Analysis of first prototype universal intelligence tests: evaluating and comparing AI algorithms and humans

نویسندگان

Javier Insa-Cabrera

José Hernández-Orallo

چکیده

Artificial Intelligence (AI) has always tried to emulate the greatest virtue of humans: their intelligence. However, although there have been many efforts to reach this goal, at a glance we notice that the intelligence of AI systems barely resembles that of humans. Today, available methods that assess AI systems are focused on using empirical techniques to measure the performance of algorithms in some specific tasks (e.g., playing chess, solving mazes or land a helicopter). However, these methods are not appropriate if we want to evaluate the general intelligence of AI and, even less, if we compare it with human intelligence. The ANYNT project has designed a new method of evaluation that tries to assess AI systems using well known computational notions and problems which are as general as possible. This new method serves to assess general intelligence (which allows us to learn how to solve any new kind of problem we face) and not only to evaluate performance on a set of specific tasks. This method not only focuses on measuring the intelligence of algorithms, but also to assess any intelligent system (human beings, animals, AI, aliens?, . . . ), and letting us to place their results on the same scale and, therefore, to be able to compare them. This new approach will allow us (in the future) to evaluate and compare any kind of intelligent system known or even to build/find, be it artificial or biological. This master thesis aims at ensuring that this new method provides consistent results when evaluating AI algorithms, this is done through the design and implementation of prototypes of universal intelligence tests and their application to different intelligent systems (AI algorithms and humans beings). From the study we analyze whether the results obtained by two different intelligent systems are properly located on the same scale and we propose changes and refinements to these prototypes in order to, in the future, being able to achieve a truly universal intelligence test.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test

In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation of a popular reinforcement learning algorithm, Q-learning. We show that a general approach to intelligence evaluation of AI algorithms is feasible. This top-down (theory-derived) approach is based on a generation of environments under a Solomonoff universal distribution instead of using a pre-defi...

متن کامل

Network Planning Using Iterative Improvement Methods and Heuristic Techniques

The problem of minimum-cost expansion of power transmission network is formulated as a genetic algorithm with the cost of new lines and security constraints and Kirchhoff’s Law at each bus bar included. A genetic algorithm (GA) is a search or optimization algorithm based on the mechanics of natural selection and genetics. An applied example is presented. The results from a set of tests carried ...

متن کامل

Comparing Humans and AI Agents

Comparing humans and machines is one important source of information about both machine and human strengths and limitations. Most of these comparisons and competitions are performed in rather specific tasks such as calculus, speech recognition, translation, games, etc. The information conveyed by these experiments is limited, since it portrays that machines are much better than humans at some d...

متن کامل

Measuring (machine) intelligence universally An interdisciplinary challenge

Artificial intelligence (AI) is having a deep impact on the way humans work, communicate and enjoy their leisure time. AI systems have been traditionally devised to solve specific tasks, such as playing chess, diagnosing a disease or driving a car. However, more and more AI systems are now being devised to be generally adaptable, and learn to solve a variety of tasks or to assist humans and org...

متن کامل

Lessons Learned from Prototyping Parallel Computer Architectures for AI Algorithms

Since many years algorithms from the eld of arti cial intelligence (AI) have been targeted for parallelization, i.e., partitioning the search problem and distributing the subproblems among multiple processing nodes. This paper reports on our experience in parallelizing and distributing AI algorithms, i.e., the design and prototype implementation of parallel computer architectures for AI algorit...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1109.5072 شماره

صفحات -

تاریخ انتشار 2011

Analysis of first prototype universal intelligence tests: evaluating and comparing AI algorithms and humans

نویسندگان

چکیده

منابع مشابه

Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test

Network Planning Using Iterative Improvement Methods and Heuristic Techniques

Comparing Humans and AI Agents

Measuring (machine) intelligence universally An interdisciplinary challenge

Lessons Learned from Prototyping Parallel Computer Architectures for AI Algorithms

عنوان ژورنال:

اشتراک گذاری